Search CORE

17 research outputs found

Recovery Guarantees for Quadratic Tensors with Limited Observations

Author: Charikar Moses
Liang Yingyu
Sharan Vatsal
Zhang Hongyang
Publication venue
Publication date: 31/10/2018
Field of study

We consider the tensor completion problem of predicting the missing entries of a tensor. The commonly used CP model has a triple product form, but an alternate family of quadratic models which are the sum of pairwise products instead of a triple product have emerged from applications such as recommendation systems. Non-convex methods are the method of choice for learning quadratic models, and this work examines their sample complexity and error guarantee. Our main result is that with the number of samples being only linear in the dimension, all local minima of the mean squared error objective are global minima and recover the original tensor accurately. The techniques lead to simple proofs showing that convex relaxation can recover quadratic tensors provided with linear number of samples. We substantiate our theoretical results with experiments on synthetic and real-world data, showing that quadratic models have better performance than CP models in scenarios where there are limited amount of observations available

arXiv.org e-Print Archive

NeuroSketch: Fast and Approximate Evaluation of Range Aggregate Queries with Neural Networks

Author: Shahabi Cyrus
Sharan Vatsal
Zeighami Sepanta
Publication venue
Publication date: 19/11/2022
Field of study

Range aggregate queries (RAQs) are an integral part of many real-world applications, where, often, fast and approximate answers for the queries are desired. Recent work has studied answering RAQs using machine learning (ML) models, where a model of the data is learned to answer the queries. However, there is no theoretical understanding of why and when the ML based approaches perform well. Furthermore, since the ML approaches model the data, they fail to capitalize on any query specific information to improve performance in practice. In this paper, we focus on modeling ``queries'' rather than data and train neural networks to learn the query answers. This change of focus allows us to theoretically study our ML approach to provide a distribution and query dependent error bound for neural networks when answering RAQs. We confirm our theoretical results by developing NeuroSketch, a neural network framework to answer RAQs in practice. Extensive experimental study on real-world, TPC-benchmark and synthetic datasets show that NeuroSketch answers RAQs multiple orders of magnitude faster than state-of-the-art and with better accuracy.Comment: Conference paper in SIGMOD 2023. arXiv admin note: text overlap with arXiv:2107.0492

arXiv.org e-Print Archive

Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness

Author: Shahabi Kameron
Sharan Vatsal
Vasudeva Bhavya
Publication venue
Publication date: 09/10/2023
Field of study

Neural networks (NNs) are known to exhibit simplicity bias where they tend to prefer learning 'simple' features over more 'complex' ones, even when the latter may be more informative. Simplicity bias can lead to the model making biased predictions which have poor out-of-distribution (OOD) generalization. To address this, we propose a framework that encourages the model to use a more diverse set of features to make predictions. We first train a simple model, and then regularize the conditional mutual information with respect to it to obtain the final model. We demonstrate the effectiveness of this framework in various problem settings and real-world applications, showing that it effectively addresses simplicity bias and leads to more features being used, enhances OOD generalization, and improves subgroup robustness and fairness. We complement these results with theoretical analyses of the effect of the regularization and its OOD generalization properties.Comment: 28 pages, 10 figures, 16 table

arXiv.org e-Print Archive